Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 25
Filter
1.
Cell Genom ; 4(4): 100538, 2024 Apr 10.
Article in English | MEDLINE | ID: mdl-38565144

ABSTRACT

Nearly all trait-associated variants identified in genome-wide association studies (GWASs) are noncoding. The cis regulatory effects of these variants have been extensively characterized, but how they affect gene regulation in trans has been the subject of fewer studies because of the difficulty in detecting trans-expression quantitative loci (eQTLs). We developed trans-PCO for detecting trans effects of genetic variants on gene networks. Our simulations demonstrate that trans-PCO substantially outperforms existing trans-eQTL mapping methods. We applied trans-PCO to two gene expression datasets from whole blood, DGN (N = 913) and eQTLGen (N = 31,684), and identified 14,985 high-quality trans-eSNP-module pairs associated with 197 co-expression gene modules and biological processes. We performed colocalization analyses between GWAS loci of 46 complex traits and the trans-eQTLs. We demonstrated that the identified trans effects can help us understand how trait-associated variants affect gene regulatory networks and biological pathways.


Subject(s)
Genome-Wide Association Study , Quantitative Trait Loci , Quantitative Trait Loci/genetics , Gene Expression Regulation/genetics , Gene Regulatory Networks/genetics , Phenotype
2.
Genome Biol ; 21(1): 211, 2020 08 24.
Article in English | MEDLINE | ID: mdl-32831138

ABSTRACT

The observation that disease-associated genetic variants typically reside outside of exons has inspired widespread investigation into the genetic basis of transcriptional regulation. While associations between the mRNA abundance of a gene and its proximal SNPs (cis-eQTLs) are now readily identified, identification of high-quality distal associations (trans-eQTLs) has been limited by a heavy multiple testing burden and the proneness to false-positive signals. To address these issues, we develop GBAT, a powerful gene-based pipeline that allows robust detection of high-quality trans-gene regulation signal.


Subject(s)
Gene Expression Regulation , Genetic Testing/methods , Genome-Wide Association Study , Gene Expression Profiling , Gene Regulatory Networks , Genotype , Humans , Polymorphism, Single Nucleotide , RNA, Messenger
3.
Hum Mol Genet ; 29(7): 1057-1067, 2020 05 08.
Article in English | MEDLINE | ID: mdl-31595288

ABSTRACT

Regulatory variation plays a major role in complex disease and that cell type-specific binding of transcription factors (TF) is critical to gene regulation. However, assessing the contribution of genetic variation in TF-binding sites to disease heritability is challenging, as binding is often cell type-specific and annotations from directly measured TF binding are not currently available for most cell type-TF pairs. We investigate approaches to annotate TF binding, including directly measured chromatin data and sequence-based predictions. We find that TF-binding annotations constructed by intersecting sequence-based TF-binding predictions with cell type-specific chromatin data explain a large fraction of heritability across a broad set of diseases and corresponding cell types; this strategy of constructing annotations addresses both the limitation that identical sequences may be bound or unbound depending on surrounding chromatin context and the limitation that sequence-based predictions are generally not cell type-specific. We partitioned the heritability of 49 diseases and complex traits using stratified linkage disequilibrium (LD) score regression with the baseline-LD model (which is not cell type-specific) plus the new annotations. We determined that 100 bp windows around MotifMap sequenced-based TF-binding predictions intersected with a union of six cell type-specific chromatin marks (imputed using ChromImpute) performed best, with an 58% increase in heritability enrichment compared to the chromatin marks alone (11.6× vs. 7.3×, P = 9 × 10-14 for difference) and a 20% increase in cell type-specific signal conditional on annotations from the baseline-LD model (P = 8 × 10-11 for difference). Our results show that TF-binding annotations explain substantial disease heritability and can help refine genome-wide association signals.


Subject(s)
Chromatin/genetics , Genetic Diseases, Inborn/genetics , Molecular Sequence Annotation , Transcription Factors/genetics , Binding Sites/genetics , Computational Biology , Gene Expression Regulation/genetics , Genetic Diseases, Inborn/classification , Genetic Diseases, Inborn/pathology , Humans , Linkage Disequilibrium/genetics , Multifactorial Inheritance/genetics , Polymorphism, Single Nucleotide/genetics , Protein Binding/genetics
4.
Nat Genet ; 51(8): 1295, 2019 Aug.
Article in English | MEDLINE | ID: mdl-31273336

ABSTRACT

In the version of the paper initially published, information on competing interests for author Benjamin M. Neale was missing. The 'Competing interests' statement should have included the sentence 'B.M.N. is on the Scientific Advisory Board of Deep Genomics'.

5.
Cell ; 177(4): 1022-1034.e6, 2019 05 02.
Article in English | MEDLINE | ID: mdl-31051098

ABSTRACT

Early genome-wide association studies (GWASs) led to the surprising discovery that, for typical complex traits, most of the heritability is due to huge numbers of common variants with tiny effect sizes. Previously, we argued that new models are needed to understand these patterns. Here, we provide a formal model in which genetic contributions to complex traits are partitioned into direct effects from core genes and indirect effects from peripheral genes acting in trans. We propose that most heritability is driven by weak trans-eQTL SNPs, whose effects are mediated through peripheral genes to impact the expression of core genes. In particular, if the core genes for a trait tend to be co-regulated, then the effects of peripheral variation can be amplified such that nearly all of the genetic variance is driven by weak trans effects. Thus, our model proposes a framework for understanding key features of the architecture of complex traits.


Subject(s)
Gene Expression Regulation/genetics , Heredity/genetics , Multifactorial Inheritance/genetics , Databases, Genetic , Gene Expression/genetics , Gene Expression Profiling/methods , Genetic Variation/genetics , Genome-Wide Association Study , Humans , Models, Theoretical , Phenotype , Polymorphism, Genetic/genetics , Quantitative Trait Loci/genetics
6.
Nat Genet ; 50(7): 1041-1047, 2018 07.
Article in English | MEDLINE | ID: mdl-29942083

ABSTRACT

There is increasing evidence that many risk loci found using genome-wide association studies are molecular quantitative trait loci (QTLs). Here we introduce a new set of functional annotations based on causal posterior probabilities of fine-mapped molecular cis-QTLs, using data from the Genotype-Tissue Expression (GTEx) and BLUEPRINT consortia. We show that these annotations are more strongly enriched for heritability (5.84× for eQTLs; P = 1.19 × 10-31) across 41 diseases and complex traits than annotations containing all significant molecular QTLs (1.80× for expression (e)QTLs). eQTL annotations obtained by meta-analyzing all GTEx tissues generally performed best, whereas tissue-specific eQTL annotations produced stronger enrichments for blood- and brain-related diseases and traits. eQTL annotations restricted to loss-of-function intolerant genes were even more enriched for heritability (17.06×; P = 1.20 × 10-35). All molecular QTLs except splicing QTLs remained significantly enriched in joint analysis, indicating that each of these annotations is uniquely informative for disease and complex trait architectures.


Subject(s)
Disease/genetics , Multifactorial Inheritance , Quantitative Trait Loci , Genome-Wide Association Study/methods , Humans , Phenotype , Polymorphism, Single Nucleotide , Quantitative Trait, Heritable
7.
Proc Natl Acad Sci U S A ; 115(2): 379-384, 2018 01 09.
Article in English | MEDLINE | ID: mdl-29279374

ABSTRACT

A major challenge in evaluating the contribution of rare variants to complex disease is identifying enough copies of the rare alleles to permit informative statistical analysis. To investigate the contribution of rare variants to the risk of type 2 diabetes (T2D) and related traits, we performed deep whole-genome analysis of 1,034 members of 20 large Mexican-American families with high prevalence of T2D. If rare variants of large effect accounted for much of the diabetes risk in these families, our experiment was powered to detect association. Using gene expression data on 21,677 transcripts for 643 pedigree members, we identified evidence for large-effect rare-variant cis-expression quantitative trait loci that could not be detected in population studies, validating our approach. However, we did not identify any rare variants of large effect associated with T2D, or the related traits of fasting glucose and insulin, suggesting that large-effect rare variants account for only a modest fraction of the genetic risk of these traits in this sample of families. Reliable identification of large-effect rare variants will require larger samples of extended pedigrees or different study designs that further enrich for such variants.


Subject(s)
Diabetes Mellitus, Type 2/genetics , Genetic Predisposition to Disease/genetics , Genetic Variation , Mexican Americans/genetics , Diabetes Mellitus, Type 2/ethnology , Diabetes Mellitus, Type 2/pathology , Family Health , Female , Gene Frequency , Genetic Predisposition to Disease/ethnology , Genome-Wide Association Study/methods , Genotype , Humans , Male , Pedigree , Phenotype , Quantitative Trait Loci/genetics , Whole Genome Sequencing/methods
8.
PLoS One ; 12(11): e0185487, 2017.
Article in English | MEDLINE | ID: mdl-29091727

ABSTRACT

The contemporary Japanese populations largely consist of three genetically distinct groups-Hondo, Ryukyu and Ainu. By principal-component analysis, while the three groups can be clearly separated, the Hondo people, comprising 99% of the Japanese, form one almost indistinguishable cluster. To understand fine-scale genetic structure, we applied powerful haplotype-based statistical methods to genome-wide single nucleotide polymorphism data from 1600 Japanese individuals, sampled from eight distinct regions in Japan. We then combined the Japanese data with 26 other Asian populations data to analyze the shared ancestry and genetic differentiation. We found that the Japanese could be separated into nine genetic clusters in our dataset, showing a marked concordance with geography; and that major components of ancestry profile of Japanese were from the Korean and Han Chinese clusters. We also detected and dated admixture in the Japanese. While genetic differentiation between Ryukyu and Hondo was suggested to be caused in part by positive selection, genetic differentiation among the Hondo clusters appeared to result principally from genetic drift. Notably, in Asians, we found the possibility that positive selection accentuated genetic differentiation among distant populations but attenuated genetic differentiation among close populations. These findings are significant for studies of human evolution and medical genetics.


Subject(s)
Asian People , Polymorphism, Single Nucleotide , Genome-Wide Association Study , Humans , Japan , Multigene Family
9.
Nat Genet ; 49(10): 1421-1427, 2017 Oct.
Article in English | MEDLINE | ID: mdl-28892061

ABSTRACT

Recent work has hinted at the linkage disequilibrium (LD)-dependent architecture of human complex traits, where SNPs with low levels of LD (LLD) have larger per-SNP heritability. Here we analyzed summary statistics from 56 complex traits (average N = 101,401) by extending stratified LD score regression to continuous annotations. We determined that SNPs with low LLD have significantly larger per-SNP heritability and that roughly half of this effect can be explained by functional annotations negatively correlated with LLD, such as DNase I hypersensitivity sites (DHSs). The remaining signal is largely driven by our finding that more recent common variants tend to have lower LLD and to explain more heritability (P = 2.38 × 10-104); the youngest 20% of common SNPs explain 3.9 times more heritability than the oldest 20%, consistent with the action of negative selection. We also inferred jointly significant effects of other LD-related annotations and confirmed via forward simulations that they jointly predict deleterious effects.


Subject(s)
Genetic Variation/genetics , Linkage Disequilibrium , Multifactorial Inheritance/genetics , Polymorphism, Single Nucleotide , Selection, Genetic , Alleles , Chi-Square Distribution , Datasets as Topic , Genetic Fitness , Humans , Models, Genetic , Molecular Sequence Annotation
10.
Am J Hum Genet ; 100(4): 605-616, 2017 Apr 06.
Article in English | MEDLINE | ID: mdl-28343628

ABSTRACT

Genetic variants that modulate gene expression levels play an important role in the etiology of human diseases and complex traits. Although large-scale eQTL mapping studies routinely identify many local eQTLs, the molecular mechanisms by which genetic variants regulate expression remain unclear, particularly for distal eQTLs, which these studies are not well powered to detect. Here, we leveraged all variants (not just those that pass stringent significance thresholds) to analyze the functional architecture of local and distal regulation of gene expression in 15 human tissues by employing an extension of stratified LD-score regression that produces robust results in simulations. The top enriched functional categories in local regulation of peripheral-blood gene expression included coding regions (11.41×), conserved regions (4.67×), and four histone marks (p < 5 × 10-5 for all enrichments); local enrichments were similar across the 15 tissues. We also observed substantial enrichments for distal regulation of peripheral-blood gene expression: coding regions (4.47×), conserved regions (4.51×), and two histone marks (p < 3 × 10-7 for all enrichments). Analyses of the genetic correlation of gene expression across tissues confirmed that local regulation of gene expression is largely shared across tissues but that distal regulation is highly tissue specific. Our results elucidate the functional components of the genetic architecture of local and distal regulation of gene expression.


Subject(s)
Gene Expression Regulation , Anxiety/genetics , Computer Simulation , Depression/genetics , Humans , Linkage Disequilibrium , Organ Specificity , Quantitative Trait Loci , Regression Analysis , Twins/genetics
11.
Eur J Hum Genet ; 25(4): 499-508, 2017 04.
Article in English | MEDLINE | ID: mdl-28098149

ABSTRACT

The Asian Diversity Project (ADP) assembled 37 cosmopolitan and ethnic minority populations in Asia that have been densely genotyped across over half a million markers to study patterns of genetic diversity and positive natural selection. We performed population structure analyses of the ADP populations and divided these populations into four major groups based on their genographic information. By applying a highly sensitive algorithm haploPS to locate genomic signatures of positive selection, 140 distinct genomic regions exhibiting evidence of positive selection in at least one population were identified. We examined the extent of signal sharing for regions that were selected in multiple populations and observed that populations clustered in a similar fashion to that of how the ancestry clades were phylogenetically defined. In particular, populations predominantly located in South Asia underwent considerably different adaptation as compared with populations from the other geographical regions. Signatures of positive selection present in multiple geographical regions were predicted to be older and have emerged prior to the separation of the populations in the different regions. In contrast, selection signals present in a single population group tended to be of lower frequencies and thus can be attributed to recent evolutionary events.


Subject(s)
Asian People/genetics , Genetic Variation , Population/genetics , Selection, Genetic , Asia , Evolution, Molecular , Genotype , Humans
12.
Sci Rep ; 6: 21198, 2016 Feb 16.
Article in English | MEDLINE | ID: mdl-26879527

ABSTRACT

Abilities related to musical aptitude appear to have a long history in human evolution. To elucidate the molecular and evolutionary background of musical aptitude, we compared genome-wide genotyping data (641 K SNPs) of 148 Finnish individuals characterized for musical aptitude. We assigned signatures of positive selection in a case-control setting using three selection methods: haploPS, XP-EHH and FST. Gene ontology classification revealed that the positive selection regions contained genes affecting inner-ear development. Additionally, literature survey has shown that several of the identified genes were known to be involved in auditory perception (e.g. GPR98, USH2A), cognition and memory (e.g. GRIN2B, IL1A, IL1B, RAPGEF5), reward mechanisms (RGS9), and song perception and production of songbirds (e.g. FOXP1, RGS9, GPR98, GRIN2B). Interestingly, genes related to inner-ear development and cognition were also detected in a previous genome-wide association study of musical aptitude. However, the candidate genes detected in this study were not reported earlier in studies of musical abilities. Identification of genes related to language development (FOXP1 and VLDLR) support the popular hypothesis that music and language share a common genetic and evolutionary background. The findings are consistent with the evolutionary conservation of genes related to auditory processes in other species and provide first empirical evidence for signatures of positive selection for abilities that contribute to musical aptitude.


Subject(s)
Aptitude , Genome, Human , Genome-Wide Association Study , Music , Selection, Genetic , Adolescent , Adult , Aged , Aged, 80 and over , Chromosome Mapping , Female , Gene Ontology , Genomics/methods , Humans , Male , Middle Aged , Young Adult
13.
Sci Rep ; 5: 17855, 2015 Dec 09.
Article in English | MEDLINE | ID: mdl-26648100

ABSTRACT

Japan has often been viewed as an Asian country that possesses a genetically homogenous community. The basis for partitioning the country into prefectures has largely been geographical, although cultural and linguistic differences still exist between some of the districts/prefectures, especially between Okinawa and the mainland prefectures. The Major Histocompatibility Complex (MHC) region has consistently emerged as the most polymorphic region in the human genome, harbouring numerous biologically important variants; nevertheless the presence of population-specific long haplotypes hinders the imputation of SNPs and classical HLA alleles. Here, we examined the extent of genetic variation at the MHC between eight Japanese populations sampled from Okinawa, and six other prefectures located in or close to the mainland of Japan, specifically focusing at the haplotypes observed within each population, and what the impact of any variation has on imputation. Our results indicated that Okinawa was genetically farther to the mainland Japanese than were Gujarati Indians from Tamil Indians, while the mainland Japanese from six prefectures were more homogeneous than between northern and southern Han Chinese. The distribution of haplotypes across Japan was similar, although imputation was most accurate for Okinawa and several mainland prefectures when population-specific panels were used as reference.


Subject(s)
Asian People/genetics , Chromosome Mapping , Genetic Variation , Genetics, Population , HLA Antigens/genetics , Haplotypes , Alleles , Evolution, Molecular , Humans , Japan , Polymorphism, Single Nucleotide , Principal Component Analysis
15.
Hum Genet ; 134(4): 375-92, 2015 Apr.
Article in English | MEDLINE | ID: mdl-25634076

ABSTRACT

The indigenous populations from Peninsular Malaysia, locally known as Orang Asli, continue to adopt an agro-subsistence nomadic lifestyle, residing primarily within natural jungle habitats. Leading a hunter-gatherer lifestyle in a tropical jungle environment, the Orang Asli are routinely exposed to malaria. Here we surveyed the genetic architecture of individuals from four Orang Asli tribes with high-density genotyping across more than 2.5 million polymorphisms. These tribes reside in different geographical locations in Peninsular Malaysia and belong to three main ethno-linguistic groups, where there is minimal interaction between the tribes. We first dissect the genetic diversity and admixture between the tribes and with neighboring urban populations. Later, by implementing five metrics, we investigated the genome-wide signatures for positive natural selection of these Orang Asli, respectively. Finally, we searched for evidence of genomic adaptation to the pressure of malaria infection. We observed that different evolutionary responses might have emerged in the different Orang Asli communities to mitigate malaria infection.


Subject(s)
Disease Resistance/genetics , Malaria/genetics , Population Groups/genetics , Selection, Genetic , Adaptation, Biological/genetics , Cadherins/genetics , Genome-Wide Association Study , Heme Oxygenase-1/genetics , Humans , Lymphotoxin-alpha/genetics , Malaysia/ethnology , Nitric Oxide Synthase Type II/genetics , Polymorphism, Single Nucleotide , Transcriptome , Tumor Necrosis Factor-alpha/genetics , fas Receptor/genetics
16.
PLoS One ; 9(10): e110974, 2014.
Article in English | MEDLINE | ID: mdl-25337808

ABSTRACT

Body fat deposition and distribution differ between East Asians and Europeans, and for the same level of obesity, East Asians are at higher risks of Type 2 diabetes (T2D) and other metabolic disorders. This observation has prompted the reclassifications of body mass index thresholds for the definitions of "overweight" and "obese" in East Asians. However, the question remains over what evolutionary mechanisms have driven the differences in adiposity morphology between two population groups that shared a common ancestor less than 80,000 years ago. The Thrifty Gene hypothesis has been suggested as a possible explanation, where genetic factors that allowed for efficient food-energy conversion and storage are evolutionarily favoured by conferring increased chances of survival and fertility. Here, we leveraged on the existing findings from genome-wide association studies and large-scale surveys of positive natural selection to evaluate whether there is currently any evidence to support the Thrifty Gene hypothesis. We first assess whether the existing genetic associations with obesity and T2D are located in genomic regions that are reported to be under positive selection, and if so, whether the risk alleles sit on the extended haplotype forms. In addition, we interrogate whether these risk alleles are the derived forms that differ from the ancestral alleles, and whether there is significant evidence of population differentiation at these SNPs between East Asian and European populations. Our systematic survey did not yield conclusive evidence to support the Thrifty Gene hypothesis as a possible explanation for the differences observed between East Asians and Europeans.


Subject(s)
Selection, Genetic , Asian People/genetics , China , Chromosomes, Human, Pair 7/genetics , Diabetes Mellitus, Type 2/genetics , Genetic Predisposition to Disease , Genome-Wide Association Study , HapMap Project , Haplotypes , Humans , Models, Genetic , Multigene Family , Polymorphism, Single Nucleotide
17.
BMC Genet ; 15: 86, 2014 Jul 22.
Article in English | MEDLINE | ID: mdl-25053360

ABSTRACT

BACKGROUND: India is home to many ethnically and linguistically diverse populations. It is hypothesized that history of invasions by people from Persia and Central Asia, who are referred as Aryans in Hindu Holy Scriptures, had a defining role in shaping the Indian population canvas. A shift in spoken languages from Dravidian languages to Indo-European languages around 1500 B.C. is central to the Aryan Invasion Theory. Here we investigate the genetic differences between two sub-populations of India consisting of: (1) The Indo-European language speaking Gujarati Indians with genome-wide data from the International HapMap Project; and (2) the Dravidian language speaking Tamil Indians with genome-wide data from the Singapore Genome Variation Project. RESULTS: We implemented three population genetics measures to identify genomic regions that are significantly differentiated between the two Indian populations originating from the north and south of India. These measures singled out genomic regions with: (i) SNPs exhibiting significant variation in allele frequencies in the two Indian populations; and (ii) differential signals of positive natural selection as quantified by the integrated haplotype score (iHS) and cross-population extended haplotype homozygosity (XP-EHH). One of the regions that emerged spans the SLC24A5 gene that has been functionally shown to affect skin pigmentation, with a higher degree of genetic sharing between Gujarati Indians and Europeans. CONCLUSIONS: Our finding points to a gene-flow from Europe to north India that provides an explanation for the lighter skin tones present in North Indians in comparison to South Indians.


Subject(s)
Ethnicity/genetics , Genetics, Population , Human Migration , Gene Flow , Gene Frequency , Genome, Human , Haplotypes , Humans , India , Polymorphism, Single Nucleotide , Selection, Genetic , White People/genetics
18.
BMC Genomics ; 15: 332, 2014 May 02.
Article in English | MEDLINE | ID: mdl-24885517

ABSTRACT

BACKGROUND: The HUGO Pan-Asian SNP Consortium (PASNP) has generated a genetic resource of almost 55,000 autosomal single nucleotide polymorphisms (SNPs) across more than 1,800 individuals from 73 urban and indigenous populations in Asia. This has offered valuable insights into the correlation between the genetic ancestry of these populations with major linguistic systems and geography. Here, we attempt to understand whether adaptation to local climate, diet and environment partly explains the genetic variation present in these populations by investigating the genomic signatures of positive selection. RESULTS: To evaluate the impact to the selection analyses due to the considerably lower SNP density as compared to other population genetics resources such as the International HapMap Project (HapMap) or the Singapore Genome Variation Project, we evaluated the extent of haplotype phasing switch errors and the consistency of selection signals from three haplotype-based approaches (iHS, XP-EHH, haploPS) when the HapMap data is thinned to a similar density as PASNP. We subsequently applied haploPS to detect and characterize positive selection in the PASNP populations, identifying 59 genomics regions that were selected in at least one PASNP populations. A cluster analysis on the basis of these 59 signals showed that indigenous populations such as the Negrito from Malaysia and Philippines, the China Hmong, and the Taiwan Ami and Atayal shared more of these signals. We also reported evidence of a positive selection signal encompassing the beta globin gene in the Taiwan Ami and Atayal that was distinct from the signal in the HapMap Africans, suggesting the possibility of convergent evolution at this locus due to malarial selection. CONCLUSIONS: We established that the lower SNP content of the PASNP data conferred weaker ability to detect signatures of positive selection, but the availability of the new approach haploPS retained modest power. Out of all the populations in PASNP, we identified only 59 signals, suggesting a strong need for high-density population-level genotyping data or sequencing data in order to achieve a comprehensive survey of positive selection in Asian populations.


Subject(s)
Polymorphism, Single Nucleotide , Selection, Genetic , Asia , Haplotypes , Humans
19.
PLoS Genet ; 10(5): e1004377, 2014 May.
Article in English | MEDLINE | ID: mdl-24832686

ABSTRACT

South Asia possesses a significant amount of genetic diversity due to considerable intergroup differences in culture and language. There have been numerous reports on the genetic structure of Asian Indians, although these have mostly relied on genotyping microarrays or targeted sequencing of the mitochondria and Y chromosomes. Asian Indians in Singapore are primarily descendants of immigrants from Dravidian-language-speaking states in south India, and 38 individuals from the general population underwent deep whole-genome sequencing with a target coverage of 30X as part of the Singapore Sequencing Indian Project (SSIP). The genetic structure and diversity of these samples were compared against samples from the Singapore Sequencing Malay Project and populations in Phase 1 of the 1,000 Genomes Project (1 KGP). SSIP samples exhibited greater intra-population genetic diversity and possessed higher heterozygous-to-homozygous genotype ratio than other Asian populations. When compared against a panel of well-defined Asian Indians, the genetic makeup of the SSIP samples was closely related to South Indians. However, even though the SSIP samples clustered distinctly from the Europeans in the global population structure analysis with autosomal SNPs, eight samples were assigned to mitochondrial haplogroups that were predominantly present in Europeans and possessed higher European admixture than the remaining samples. An analysis of the relative relatedness between SSIP with two archaic hominins (Denisovan, Neanderthal) identified higher ancient admixture in East Asian populations than in SSIP. The data resource for these samples is publicly available and is expected to serve as a valuable complement to the South Asian samples in Phase 3 of 1 KGP.


Subject(s)
Genetic Variation , Genetics, Population , Genome, Human , Haplotypes , Humans , India , Polymorphism, Single Nucleotide
20.
Am J Hum Genet ; 92(6): 866-81, 2013 Jun 06.
Article in English | MEDLINE | ID: mdl-23731540

ABSTRACT

Natural selection is a significant force that shapes the architecture of the human genome and introduces diversity across global populations. The question of whether advantageous mutations have arisen in the human genome as a result of single or multiple mutation events remains unanswered except for the fact that there exist a handful of genes such as those that confer lactase persistence, affect skin pigmentation, or cause sickle cell anemia. We have developed a long-range-haplotype method for identifying genomic signatures of positive selection to complement existing methods, such as the integrated haplotype score (iHS) or cross-population extended haplotype homozygosity (XP-EHH), for locating signals across the entire allele frequency spectrum. Our method also locates the founder haplotypes that carry the advantageous variants and infers their corresponding population frequencies. This presents an opportunity to systematically interrogate the whole human genome whether a selection signal shared across different populations is the consequence of a single mutation process followed subsequently by gene flow between populations or of convergent evolution due to the occurrence of multiple independent mutation events either at the same variant or within the same gene. The application of our method to data from 14 populations across the world revealed that positive-selection events tend to cluster in populations of the same ancestry. Comparing the founder haplotypes for events that are present across different populations revealed that convergent evolution is a rare occurrence and that the majority of shared signals stem from the same evolutionary event.


Subject(s)
Models, Genetic , Selection, Genetic , Software , Chromosomes, Human/genetics , Computer Simulation , Evolution, Molecular , Founder Effect , Gene Frequency , Genetics, Population , Genome, Human , Haplotypes , Humans , Polymorphism, Single Nucleotide , Principal Component Analysis
SELECTION OF CITATIONS
SEARCH DETAIL
...